Scaling transactional workloads on the cloud
نویسنده
چکیده
In this paper, we address the problem of transparently scaling out transactional (OLTP) workloads on relational databases, to support database-as-a-service in cloud computing environment. The primary challenges in supporting such workloads include choosing how to partition the data across a large number of machines, minimizing the number of distributed transactions, providing high data availability, and tolerating failures gracefully. Capturing and modeling the transactional workload over a period of time, and then exploiting that information for data placement and replication has been shown to provide significant benefits in performance, both in terms of transaction latencies and overall throughput. However, such workload-aware data placement approaches can incur very high overheads, and further, may perform worse than naive approaches if the workload changes. In this work, we propose SWORD, a scalable workload-aware data partitioning and placement approach for OLTP workloads, that incorporates a suite of novel techniques to significantly reduce the overheads incurred both during the initial placement, and during query execution at runtime. We model the workload as a hypergraph over the data items, and propose using a hypergraph compression technique to reduce the overheads of partitioning. We have built a workload-aware active replication mechanism in SWORD to increase availability and enable load balancing. We propose the use of fine-grained quorums defined at the level of groups of tuples to control the cost of distributed updates, improve throughput, and provide adaptability to different workloads. To our knowledge, SWORD is the first system that uses fine-grained quorums in this context. The results of our experimental evaluation on SWORD deployed on an Amazon EC2 cluster show that our techniques result in orders-of-magnitude reductions in the partitioning and bookkeeping overheads, and improve tolerance to failures and workload changes; we also show that choosing quorums based on the query access patterns enables us to better handle query workloads with different read and write access patterns.
منابع مشابه
An Efficient Oblivious Database for the Public Cloud
We present ObliDB, a secure SQL database for the public cloud that supports both transactional and analytics workloads and protects against access pattern leakage. With databases being a critical component in many applications, there is significant interest in outsourcing them securely. Hardware enclaves offer a strong practical foundation towards this goal by providing encryption and secure ex...
متن کاملScaling Up Mixed Workloads: A Battle of Data Freshness, Flexibility, and Scheduling
The common “one size does not fit all” paradigm isolates transactional and analytical workloads into separate, specialized database systems. Operational data is periodically replicated to a data warehouse for analytics. Competitiveness of enterprises today, however, depends on real-time reporting on operational data, necessitating an integration of transactional and analytical processing in a s...
متن کاملRelational Cloud: a Database Service for the cloud
This paper introduces a new transactional “database-as-a-service” (DBaaS) called Relational Cloud. A DBaaS promises to move much of the operational burden of provisioning, configuration, scaling, performance tuning, backup, privacy, and access control from the database users to the service operator, offering lower overall costs to users. Early DBaaS efforts include Amazon RDS and Microsoft SQL ...
متن کاملTime Series Forecasting of Cloud Data Center Workloads for Dynamic Resource Provisioning
Cloud computing offers on-demand, elastic resource provisioning that allows an enterprise to provide services to their customers at an acceptable quality while consuming only the requisite computing resources as a utility. Since cloud computing resources scale elastically, utilizing cloud computing reduces the risk of over-provisioning, wasting resources during non-peak hours, and reduces the r...
متن کاملMixed Batch and Transactional Workloads for Cloud Computing Jobs
In this mixed batch and transactional workloads for cloud computing jobs we implemented a technique that manages a long running jobs and OLTP it contains mixed workloads of all the types like word, video and image. In this process job scheduler plays an important role, it is assigned for managing workloads and also is an application for controlling non viewing or unattended background program p...
متن کامل